IMAGE SEQUENCE DESCRIPTION USING SPATIOTEMPORAL FLOW CURVES: TOWARD MOTION-BASED RECOGNITION By
نویسنده
چکیده
Recovering a hierarchical motion description of a long image sequence is one way to recognize objects and their motions. Intermediate-level and high-level motion analysis, i.e., recognizing a coordinated sequence of events such as walking and throwing, has been formulated previously as a process that follows high-level object recognition. This thesis develops an alternative approach to intermediate-level and high-level motion analysis. It does not depend on complex object descriptions and can therefore be computed prior to object recognition. Toward this end, a new computational framework for low and intermediate-level processing of long sequences of images is presented. Our new computational framework uses spatiotemporal (ST) surface ow and ST ow curves. As contours move, their projections into the image also move. Over time, these projections sweep out ST surfaces. Thus, these surfaces are direct representations of object motion. ST surface ow is de ned as the natural extension of optical ow to ST surfaces. For every point on an ST surface, the instantaneous velocity of that point on the surface is recovered. It is observed that arc length of a rigid contour does not change if that contour is moved in the direction of motion on the ST surface. Motivated by this observation, a function measuring arc length change is de ned. The direction of motion of a contour undergoing motion parallel to the image plane is shown to be perpendicular to the gradient of this function. ST surface ow is then used to recover ST ow curves. ST ow curves are de ned such that the tangent at a point on the curve equals the ST surface ow at that point. ST ow curves are then grouped so that each cluster represents a temporally-coherent structure, i.e., structures that result from an object or surface in the scene undergoing motion. Using these clusters of ST ow curves, separate moving objects in the scene can be hypothesized and occlusion and disocclusion between them can be identi ed. The problem of detecting cyclic motion, while recognized by the psychology community, has received very little attention in the computer vision community. In order to show the representational power of ST ow curves, cyclic motion is detected using ST ow curves without prior recovery of complex object descriptions.
منابع مشابه
Matching mixtures of curves for human action recognition
A learning-based framework for action representation and recognition relying on the description of an action by time series of optical flow motion features is presented. In the learning step, the motion curves representing each action are clustered using Gaussian mixture modeling (GMM). In the recognition step, the optical flow curves of a probe sequence are also clustered using a GMM, then eac...
متن کاملAction Recognition by Matching Clustered Trajectories of Motion Vectors
A framework for action representation and recognition based on the description of an action by time series of optical flow motion features is presented. In the learning step, the motion curves representing each action are clustered using Gaussian mixture modeling (GMM). In the recognition step, the optical flow curves of a probe sequence are also clustered using a GMM and the probe curves are m...
متن کاملMatching Mixtures of Trajectories for Human Action Recognition
A learning-based framework for action representation and recognition relying on the description of an action by time series of optical flow motion features is presented. In the learning step, the motion curves representing each action are clustered using Gaussian mixture modeling (GMM). In the recognition step, the optical flow curves of a probe sequence are also clustered using a GMM, then eac...
متن کاملSpatiotemporal segmentation using genetic algorithms
Segmentation is the process of identifying uniform regions based on certain conditions. Segmentation has been used for a long time in image analysis and computer vision for a variety of applications. In particular, there has been a growing interest in video sequence segmentation mainly due to the development of MPEG-4, which enables the content-based manipulation of multimedia data [1,2]. For t...
متن کاملObject recognition using spatiotemporal signatures
The sequence of images generated by motion between observer and object specifies a spatiotemporal signature for that object. Evidence is presented that such spatiotemporal signatures are used in object recognition. Subjects learned novel, three-dimensional, rotating objects from image sequences in a continuous recognition task. During learning, the temporal order of images of a given object was...
متن کامل